FACE RECOGNITION METHOD,RECORDING MEDIUM THEREOF 
AND FACE RECOGNITION DEVICE 

BACKGROUND OF THE INVENTION 
The present invention relates to a face recognition method for 
collating a face-image inputted from an image input device with face 
shapes and face image data converted into database and a recording 

5 medium storing the face recognition method as a program. 
Description of the Related Art 

So far, in the case of searching a specific object face image from 
database storing face images, manual detection in which the object face 
image is simply put beside a display and compared with the face images 

10 displayed one by one from the database to search the identical face image 
has been performed. 

In the conventional style, generally, a large number of human 
faces are shot from plural directions respectively and converted into 
database with classification items such as the person's name, sex and age. 

15 In the following, it will be called a database face image. At the stage of 
detection and collation, if a face image name etc. of which are unknown is 
obtained, the object database is narrowed down by characteristics such 
as sex and age, pictures or image data are displayed on a screen to be 
compared manually with the enquiry face image, and the identification of 

20 the images is judged by a man. On this occasion, characteristic points 
of the enquiry face image are uniformly grasped as imagery. When a 
face image approximate to the imagery is found, it is specified as an 
approximate database face image, and if it approximates closely, the 
enquiry face image and the database face image are compared carefully. 

25 Although it is improbable that both the images are exactly identical, they 
can be regarded as almost the same allowing for predicable change of the 
face image, and often judged to be the face images of the same person. 



These recent years, an attempt to automate the 
above-mentioned process has been made, in which the database face 
image and the enquiry face image are compared, collated or detected as 
two-dimensional image data. As the method, a facial part is 
automatically cut out from the background of the enquiry face image, 
and its size, brightness, contrast and so on are normalized so as to be 
equal to those of the database face image. Then coincidence degree of 
both the images is estimated by placing them, or parts of them, one on 
the other etc., and a pair showing high coincidence degree is judged to be 
images of the same person. The method is realized with high precision 
when the database face image and the enquiry face image are shot in the 
similar settings by the similar devices, and also an object person wears 
almost the same expression at the shooting. 

The method described above is basically a method for 
estimating coincidence degree of contrast between two-dimensional 
images. However, a human face has essentially a three-dimensional 
shape, therefore even if the expressions of the object person are almost 
the same, the figure and the contrast of the face images differ depending 
on angles of the face or lighting conditions. Consequently, the images of 
the different persons shot at the same angles in the same lighting 
conditions, rather than the images of the same person shot at the 
different angles in the different lighting conditions, tend to indicate the 
higher coincidence degree. In order to avoid the problem, it can be 
considered to have the images corresponding to conditions regarding all 
combinations of angles and lighting for the same person as database. 
However, in this case, the huge amount of data is needed. Thus a 
system capable of expressing a three-dimensional shape by small amount 
of data and collating it with a two-dimensional enquiry image which does 
not demand strict shooting conditions, considering practicality has come 
into request. 
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SUMMARY OF THE INVENTION 
It is therefore an object of the present invention to provide a 
face recognition system and an object recognition system which overcome 
the limit of the conventional recognition system implemented by 

5 comparing two-dimensional images. 

In accordance with the first aspect of the present invention, for 
achieving the above object, there is provided a face recognition method 
for identifying a person by detecting a three-dimensional shape of the 
person's face and a surface image thereof and collating the 

10 three-dimensional shape of the face and the surface image with enquiry 
face image data, comprising the steps of estimating shooting conditions 
of the enquiry face image data, generating referential face image data by 
the shooting conditions and the three-dimensional shape and its surface 
image using a graphics means, comparing the referential face image data 

15 to the enquiry face image data, and identifying the person of the enquiry 
face image data with the person of the referential face image data when a 
difference between both the face image data is small. 

In accordance with the second aspect of the present invention, 
the face recognition method having database which stores 

20 three-dimensional shapes of human faces and surface images thereof for 
identifying a person by collating the three-dimensional shape of the 
specific person's face and the surface image stored in the database with 
enquiry face image data, comprises the steps of estimating shooting 
conditions of the enquiry face image data, generating referential face 

25 image data by the shooting conditions and the three-dimensional shape 
and its surface image stored in the database using a graphics means, 
comparing the referential face image data to the enquiry face image data, 
and identifying the person of the enquiry face image data with the person 
of the referential face image data when a difference between both the 

30 face image data is small. 



In accordance with the third aspect of the present invention, 
there is provided an object recognition method for identifying an object 
by detecting a three-dimensional shape of the object and a surface image 
thereof and collating the three-dimensional shape of the object and the 
surface image with enquiry image data, comprising the steps of 
estimating shooting conditions of the enquiry image data, generating 
referential image data by the shooting conditions and the 
three-dimensional shape and its surface image using a graphics means, 
comparing the referential image data to the enquiry image data, and 
identifying the object of the enquiry image data with the object of the 
referential image data when a difference between both the image data is 
small. 

In accordance with the fourth aspect of the present invention, 
the object recognition method having database which stores 
three-dimensional shapes of a large number of objects and surface 
images thereof for identifying an object by collating the 
three-dimensional shape of the specific object and the surface image 
stored in the database with enquiry image data, comprises the steps of 
estimating shooting conditions of the enquiry image data, generating 
referential image data by the shooting conditions and the 
three-dimensional shape and the surface image stored in the database 
using a graphics means, comparing the referential image data to the 
enquiry image data, and identifying the object of the enquiry image data 
with the object of the referential image data when a difference between 
both the image data is small. 

In accordance with the fifth aspect of the present invention, 
there is provided a face recognition device for identifying a person by 
detecting a three-dimensional shape of the person's face and a surface 
image thereof and collating the three-dimensional shape of the face and 
the surface image with enquiry face image data, comprising an image 



input means for obtaining the enquiry face image by specifying shooting 
conditions of the enquiry face image data, a graphics means for 
generating referential face image data by the shooting conditions and the 
three-dimensional shape and its surface image, and an image collating 
means for comparing the referential face image data with the enquiry 
face image data to evaluate the data, and identifying the person of the 
enquiry face image data with the person of the referential face image 
data when a difference between both the face image data is small. 

In accordance with the sixth aspect of the present invention, the face 
recognition device having database which stores three-dimensional 
shapes of a large number of human faces and surface images thereof for 
identifying a person by collating contents of the database with enquiry 
face image data, comprises a condition input means for specifying 
shooting conditions of the enquiry face image data to input, the database 
for storing the three-dimensional shape data of faces and the surface 
image data, a graphics means for generating referential face image data 
by the shooting conditions and the three-dimensional shape data and its 
surface image data stored in the database, an image input means for 
obtaining the enquiry face image data, and a image collating means for 
comparing and collating the referential face image data with the enquiry 
face image data, and identifies the person of the enquiry face image data 
with the person of the referential face image data when a difference 
between both the face image data is small. 

In accordance with the seventh aspect of the present invention, the 
object recognition device having database which stores three-dimensional 
shapes of a large number of objects and surface images thereof for 
identifying an object by collating contents of the database with enquiry 
image data, comprises a condition input means for specifying shooting 
conditions of the enquiry image data to input, the database for storing 
the three-dimensional shape data of objects and the surface image data, 
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a graphics means fox generating referential image data by the shooting 
conditions and the three-dimensional shape data and its surface image 
data stored in the database, an image input means for obtaining the 
enquiry image data, and a image collating means for comparing and 
5 collating the referential image data with the enquiry image data, and 
identifies the object of the enquiry image data with the object of the 
referential image data when a difference between both the image data is 
small. 

In accordance with the eighth aspect of the present invention, 

10 there is provided a recording medium storing a program being readable 
by a computer to operate a recognition method having database which 
stores three-dimensional shapes of a large number of human faces and 
surface images thereof for identifying a person by collating contents of 
the database with enquiry face image data, wherein the program 

15 comprises the steps of estimating shooting conditions of the enquiry face 
image data, generating referential face image data by the shooting 
conditions and the three-dimensional shape and its surface image stored 
in the database using a graphics means, comparing and collating the 
referential face image data to the enquiry face image data, and 

20 identifying the person of the enquiry face image data with the person of 
the referential face image data when a difference between both the face 
image data is small. 

In accordance with the ninth aspect of the present invention, the 
recording medium stores a program being readable by a computer to 

25 operate a recognition method having database which stores 
three-dimensional shapes of a large number of objects and surface 
images thereof for identifying an object by collating contents of the 
database with enquiry image data, wherein the program comprises the 
steps of specifying shooting conditions of the enquiry image data to input, 

30 generating referential image data by the three-dimensional shape and its 
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surface image stored in the database using a graphics means, obtaining 
the enquiry image data by an image input means, comparing and 
collating the referential image data to the enquiry image data, and 
identifying the object of the enquiry image data with the object of the 
5 referential image data when a difference between both the image data is 
small. 

In accordance with the tenth aspect of the present invention, the 
recording medium stores a program being readable by a computer to 
operate a recognition method for identifying an object by detecting a 

10 three-dimensional shape of the object and a surface image thereof and 
collating the three-dimensional shape of the object and the surface image 
with enquiry image data, wherein the program comprises the steps of 
estimating shooting conditions of the enquiry image data, generating 
referential image data by the shooting conditions and the 

15 three-dimensional shape and its surface image using a graphics means, 
comparing and collating the referential image data with the enquiry 
image data, and identifying the object of the enquiry image data with the 
object of the referential image data when a difference between both the 
image data is small. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 
The objects and features of the present invention will become 
more apparent from the consideration of the following detailed 
description taken in conjunction with the accompanying drawings in 
25 which: 

Fig. 1 is a block diagram showing constitution of a face image 
recognition system according to an embodiment of the present invention,' 

Fig. 2 is a diagram explaining an example of three-dimensional 
shape obtaining means of the present invention; 
30 Fig. 3 is a diagram explaining pixel value obtained for each 
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pixel of a video camera by the example of three-dimensional shape 
obtaining means of the present invention; 

Fig. 4 is a diagram explaining a method of deciding an angle of 
face of an enquiry face image by a condition input means of the present 
5 invention; 

Fig. 5 is a diagram explaining a method of deciding a direction 
of lighting of an enquiry face image by a condition input means of the 
present invention; 

Fig. 6 is a diagram explaining the Phong's model for generating 
10 a referential color image by a graphics means of the present invention; 

Fig. 7 is a block diagram showing typical constitution of an 
image collating means according to the present invention; and 

Fig. 8 is a diagram explaining a method of collating images by 
the image collating means according to the present invention. 

15 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

Referring now to the drawings, a description of a preferred 
embodiment of the present invention will be given in detail. 

An embodiment of a face recognition system according to the 

20 present invention will be explained referring to Fig. 1. Using a 
three-dimensional shape measuring means 1, three-dimensional shapes 
and color images of various faces 2 which are objects of detection are 
obtained and stored in database means 5 as a shape data 3 and a color 
image data 4 following a prescribed expressing pattern. In this case, 

25 shooting of the color image data 4 is implemented being lighted from 
every direction so that each pixel value of the obtained images 
approximates reflectance ratio of skin. Incidentally, it is also possible 
to convert the pixel values of the obtained images into the pixel value of 
the reflectance ratio using the method as estimating an influence of the 

30 lighting in the obtained color image and eliminating the influence. 



Every face facing left, right, up and down direction is inputted in as the 
color image data corresponding to elements of the three primary colors: 
red (R), green (G) and blue (b), or using parameters of color capable of 
indicating chromaticity, luminosity and saturation. Besides, the shape 
data 3 is shot from front, left, right and back, and shapes of contour, hair, 
eyes, nose, mouth, eyebrows and so on are stored as database for every 
face facing left, right, up and down. 

An enquiry image 6 is inputted by an image input means 7. 
A video camera etc. is used as the image input means 7. Although a 
high precision camera having three million pixels is fine, such high 
precision is unnecessary and a camera of one million pixels capable of 
grasping characteristic points of the face image is enough. 

The user inputs shooting conditions 8 (an angle or a direction 
of the face and a lighting condition) of the image by a condition input 
means 9. A keyboard, a mouse and a touch board or the like are used 
as the condition input means 9. The lighting condition as well as the 
angle of the face are designated by designating keys of seven angles: 
front angle, left / right 90° and 45° angles, upper side 45° angle, lower 
side 45° angle and so forth, even though it is only three surfaces: front, 
right and left, needed to indicate the angle of the face. It is also 
possible to designate the angles by clicking a specific point on the 
display. 

A graphics means 10 generates a two-dimensional referential 
color image 11 with computer graphics method, using the 
three-dimensional shape data 3 and the color image data 4 stored in the 
database means 5 according to the shooting condition 8. In this event, 
it is preferable that the shooting condition is coincided with the shooting 
condition of the enquiry image 6. Thus the shooting condition should be 
designated following shooting situations rather than designated by a 
rough guess. 
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An image collating means 12 judges which face is probably the 
identical one with the enquiry image 6 by comparing the enquiry image 6 
with the referential color images 11 of each person, calculating 
coincidence degrees and comparing the coincidence degrees. The 

5 calculation of the coincidence degree will be described later. 

Referring now to the drawings, a description of each part of the 
present embodiment will be given in detail. Figs. 2 and 3 are showing 
the three-dimensional shape measuring means 1 for obtaining a 
three-dimensional shape and a color image of a face. As an example of 

10 the three-dimensional shape measuring means 1, constitution of "A 
Range Finder for Human Face Measurement" described in detail in the 
technical report of Institute of Electronics, Information and 
Communication Engineers, Pattern Recognition and Media 
Understanding Society (PRMU99-24) issued on June 1999 written by 

15 Ishiyama, Sakamoto et al., will be explained briefly. In the technical 
report, some examples of measuring method of a three-dimensional 
shape where a three-dimensional shape model is obtained using the 
range finder are introduced. Here, a phase-shift method using a 
sinusoidal lattice is selected from among other methods such as a 

20 spot-light / slit-light scanning method using a laser beam which requires 
a long measuring time, a space cord method using a high-speed pattern 
projection / camera device and so on. In the phase -shift method, a 
sinusoidal light-intensity pattern is projected as its phase is changing in 
time sequence, and space is coded by a phase value of light-intensity 

25 pattern at observed time sequence. The phase -shift method has 
characteristics such as needing only three images at least, being capable 
of measuring three-dimensional shape of every pixel, easily obtaining 
higher precision by adding pixels and numbers of the images, and being 
capable of measuring a color texture image simultaneously. 

30 In Fig. 2, a pattern of a phase lattice 103 is projected to a face 
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101 from a light pattern projector 102. The phase lattice 103 has 
spatially sinusoidal transmittance shown in Fig. 3, and as its position is 
changed, a plurality of images are inputted. Then, at a specific point on 
the face, intensity of projected light changes by time as shown in Fig. 3. 
5 When the scene is shot by a video camera 104, intensity of a reflected ray 
at the pixels (u, v) on the t-th image is obtained as the following equation 
(1). 

I(u, v, t)=Ibias(u, v)+A(u, v)cos( (j) +t) (1) 

10 

Here, Ibi as (u, v) expresses the reflected ray of a direct current component 
of an environmental light pattern and a projection light pattern, A(u, v) 
expresses intensity of the projection light pattern. 

Incidentally, 0 indicates an initial phase value of the first 
15 image of a projected sinusoidal wave pattern at the position. In Fig. 3, 
the initial phase value $ is set about at 120°. The initial phase value <f> 
in each pixel is calculated by an image processing means 105 using 
inputted images according to following equations (2) to (5). 

I ~ I ha* + A (cos <f> cos t - sin ^ sin t) (2) 

J 2 * / cos tdt = A cos <f> j 2 * cos 2 tdt - C cos <j> (3) 

J o * I sin tdt = - A sin (f> £ * sin 2 tdt = -C sin <j> (4) 

- \ 2n I sin tdt 

0(«,v) = tan - 1 -A (5 ) 

/ cos tdt 

Jo 

20 

Since the initial phase value $ closely relates to a projection 
angle a shown in Fig. 2, when the 4> in each pixel is obtained, the 
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projection angle a is defined. Besides, an incident angle 3 to the 
video camera 104 is obtained from the position of the pixel (u, v). A 
positional relationship between a projection point of the sinusoidal wave 
pattern and a camera has been decided. Thus the two points, the 

5 projection angle a and the incidence angle jS of the video camera 104 
are obtained, a triangle is decided uniquely, and thereby distance from 
the video camera 104 to an object in each pixel is obtained. Finally, a 
three-dimensional coordinates (X, Y, Z) of a face surface for each pixel is 
obtained by setting an origin at an appropriate position. Thereby the 

10 shape data 3 of the face surface is generated by generally known 
technique such as comprising a triangle patch by combining the obtained 
three dimensional coordinates with adjacent others. 

Moreover, the color image data 4 of the object face is obtained 
from a plurality of images by following equation (6). 

15 

J fc „(«,v) = ^r I{u,vj)dt (6) 

2k jo 

The foregoing is described in the above-described technical 
report. Also in the present invention, the shape data 3 and the color 
image data 4 of the face image are obtained by the same process by the 
20 three-dimensional shape obtaining means 1 according to the above 
equations. 

Referring to Fig. 4, an example of operations by the condition 
input means 9 and the graphics means 10 will be descried. A display 
monitor- Al 3 is a monitor for displaying the enquiry image, provided to 
25 the face recognition system to display situations of the condition input 
means 9 and the system. 

In Fig. 4, since the enquiry image 6 is generally incapable of 
defining an angle of a face or a position of lighting and a camera, it is not 
shot being lighted from every direction as the color image data stored in 
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the database means 5. Consequently, the image cannot be collated with 
the data stored in the database means 5 directly. In order to realize the 
collation, the shooting conditions 8 of the enquiry image is inputted in 
the graphics means 10 by the condition input means 9, and thereby the 

5 referential color image 11 is generated using the stored shape data 3 and 
color image data 4. 

The shooting condition 8 basically means an angle of a face and 
lighting directions. First, the face's angle is defined. According to the 
present embodiment, the face's angle is decided by comparing the display 

10 monitor-A13 showing the enquiry image 6 with a display monitor-B14 
showing images at standard angles and selecting one by a user. A 
model example showing five varieties of face images: facing "forward", 
"right", "left", "downward" and "upward", on the display monitor-B14 is 
shown in Fig. 4. The user selects the closest angle by input keys as he / 

15 she is comparing the enquiry image and the images at standard angles. 
Although in the example of Fig. 4, the "left angle" is selected from among 
the five varieties of images, it is desirable to select the angle as precise as 
possible by comparing the enquiry image with images at dozens of angles 
in order to obtain higher collating precision. Incidentally, the shooting 

20 condition 8 can be designated by estimating a standard shooting 
condition when the shooting condition of the enquiry image 6 cannot be 
specified. 

On the occasion of storing the shape data 3 and the color image 
data 4 in the database means 5, characteristic points such as "position(s) 

25 of pupil(s)" and "a center of a mouth" are supplied manually and the 
positions of the points are also stored in the database means 5. 
Similarly, as for the enquiry image 6, the angle can be decided by 
supplying the above characteristic points manually and comparing the 
positions of the characteristic points of both the images. 

30 Next, the lighting direction is defined. In this event, the 
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lighting direction is also decided by comparing standard lighting images 
corresponding to the above decided face's angle displayed on the display 
monitor-B14 with the enquiry image 6 and selecting one by the user. In 
an example of Fig. 5, five varieties of images of standard lighting 
directions: "forward", "right", "left", "downward" and "upward", for the 
face facing forward are shown. The user selects the closest lighting by 
input keys as he / she is comparing the enquiry image 6 and the standard 
lighting images. Although in the example of Fig. 5, the lighting 
direction is selected from among the five varieties of images, it is 
desirable to select the lighting direction as precise as possible by 
comparing the enquiry image with dozens of standard lighting images in 
order to obtain higher collating precision. Incidentally, a known 
method for estimating the lighting direction automatically from the 
image is applicable. 

Besides, in the above example of the five varieties of images, 
the lighting direction is only one, however, there can be the lighting from 
two or three directions. In this case, it is desirable to request the 
selection of the shooting conditions for a plurality of lighting directions to 
be displayed on the display monitor and select ones by key-input. 
Furthermore, it is also desirable to set variations in intensity of 
illuminance, for example strong, medium and weak illumination 
intensity, and select one. 

As a result of the foregoing, the angle of the face and the 
lighting direction are decided. Using the shooting conditions 8, and the 
shape data 3 and the color image data 4 stored in the database means 5, 
the referential color image data 11 is generated by the graphics means 10. 
As for the method to generate the referential color image data 11, a 
generally known method may be used. For example, the shape data 3 
expresses only characteristic parts vicinity of eyes and a nose as a 
surface shape of a face by a group of fine triangle patches 61 as shown in 
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Fig. 6. An angle of whole surface of the face to a camera is modified 
according to the angle obtained by the above process. A normal vector n 
of the surface is obtained from information of the triangle patches 61 
whose direction has been modified. Showing one of the triangle patches 

5 in Fig. 6 as an example, when a side of the nose is defined as an object 
area which is an only flat surface surrounded by vertices A, B and C of a 
triangle, an incident vector s to the triangle patch 61 is obtained from the 
lighting direction obtained by the above process. When the direction of 
the camera is given as an viewing vector v, the value (R, G, B> incoming 

10 to the camera as the color of the triangle patch 61 is expressed as the 
following equation (7) using a well known reflection model of Phong. 
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Here, the (Ro, Go, Bo) t expresses a longitudinal matrix of a transposed 
matrix (Ro, Go, Bo), and indicates the color value of diffuse reflection 

15 components of an object surface, in which the color image data 4 stored 
in the database means 5 may be used. 

Incidentally, (Rw, Gw, Bw) t is a value indicating the color of 
a specular reflection component of the object surface, 0 is an angle the 
normal vector n and the incident vector s form and (p is an angle a 

20 specular reflection direction p and the ejection vector v form. Moreover, 
d is a constant indicating the proportion of a part due to a luminous 
source and a part due to environmental light in the diffuse reflectance 
component which is determined by a character of the surface. 

The final collation of the enquiry image 6 and the referential 

25 color image 11 is executed by the image collating means 12. With a 
recognition method of two-dimensional images which is generally known 
so far, a variety of embodiments can be realized. A typical embodiment 
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is shown in Fig. 7. The face images of the enquiry image 6 and the 
referential color image 11 are normalized respectively by operating 
position / size / luminosity normalizing means 121 and 122 on them. In 
the normalization, positions of both pupils 111 and 112, in the enquiry 
face image 6 for example, are detected, and then parallel displacement, 
rotation, magnification and minimization of the image are executed so 
that the detected positions come to standard pupil positions 114 and 115 
in a standard face image 113 as shown in Fig. 8. Although it is possible 
to detect the pupil positions in the image beforehand as the user is 
observing a display means, an automatic image processing method is also 
usable. As described above, having the standard face image 113 as a 
basis, the pupil positions are matched, then differences of width of pupils, 
width of noses, positions of the noses, width of mouths, positions of the 
mouths, width of upper Hps and lower lips, width of ears, positions of the 
ears, length of the ears and the like are obtained and thereby normalized. 
As for the normalizing method, a method described in "Processing Face 
Image for Newspaper" on pages I I -397-403 of "Meeting on Image 
Recognition and Understanding (MIRU'98)" (Information Processing 
Society of Japan Symposium Series) written by Sakamoto, Tajima et al. 
is usable. 

Moreover, as the standard face image 113, either the enquiry 
face image 6 or the referential color image 11 can be used. 

Furthermore, as for the luminosity, a comparing area 116 is 
defined based on the position of the pupils shown in Fig. 8, and 
distributions of pixel values R, G, B in the comparing area 116 of both 
the enquiry face image 6 and the referential color image 11 are 
calculated. For example, an average value R a and a standard deviation 
a r of distribution of the pixel value R in the comparing area 116 of the 
enquiry image 6 are obtained. Then, each pixel value is converted into 
R' so as to match with a predetermined average value R a0 and standard 
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deviation a r o of a standard pixel value R in the comparing area of a face 
by the following equation (8). The element values G, and B are 
processed likewise. 

R 1 =(R-R a )^L+R a0 (8) 

<T r 

5 A normalized enquiry image 123 and a normalized referential 

color image 124 obtained thus are compared by an image comparing 
means 125, and the difference between the images is calculated as the 
following equation (9), thereby the rth person whose difference Di 2 is 
minimum is supposed to be identical with the person of the enquiry 

10 image 6 and outputted. 

D i = T« R '( x >y)- R \ ( x >y)) 2 +(G'(x,y)-G\ (x,y)f + (B'(x,y)-B\ (x,y)) 2 } (9) 

In the equation (9), R'(x, y), G'(x, y), B'(x, y) are the pixel 
15 values of the normalized enquiry image 123 and R'i(x, y), G'i(x, y), B'i(x, y) 
are the pixel values of the normalized referential color image 124. 

The minimum difference Di 2 means, in other words, the highest 
similarity and the similarity is in proportion to the inverse number of the 
difference Di 2 . Thus, the similarity between the enquiry image and the 
20 referential color image which is graphically generated using the 
three-dimension shape data and the color image data in the database can 
be obtained. 

Incidentally, while the above embodiment has been described 
using color images, it can be realized likewise using monochrome gray 
25 images. 

As is explained above, one of the referential color images 11 of 
a large number of the face shape and image data converted into the 
database and the enquiry image 6 can be judged to be identical by the 
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image collating means 12 if the images coincide. 

While the above embodiment has been described mainly about 
face images, the present invention is also applicable to things other than 
human faces* characteristic buildings and historic sites; nature such as 
5 mountains, rivers and the sea; animals and plants, in common in point of 
collating. 

Moreover, when a computer reads the face recognition method 
as a program from a readable recording medium, the computer operates 
according to the program, therefore it is possible to transmit the program 

10 by transmitting media of the Internet or the e-mail style as well as 
expand the program. Furthermore, the object of the program is not 
limited to the collating method, the recording medium including 
database is good as well. An effective face collation can be executed by 
accumulating a large number of three-dimensional shapes of the face 

15 images and surface images thereof. 

With the present invention explained above, it is made possible 
to constitute a system which detects and collates an enquiry face image 
precisely with face shape and image data converted into database. 

Moreover, the above constitution also can be realized as a 

20 computer program using a computer system. It is made possible to 
provide a face recognition system having the same effects as the above 
embodiment by storing the computer program in a recording medium 
such as a package software etc., and installing it with database in a 
personal computer, a super computer and so forth. 

25 While the preferred embodiment of the present invention has 

been described using specific terms, such description is for illustrative 
purposes only, and it is to be understood that changes and variations 
may be made without departing from the spirit or the scope of the 
following claims. 

30 



